Disambiguation of Korean utterances using automatic intonation recognition
نویسندگان
چکیده
The paper describes a research on a use of intonation for disambiguating utterance types of Korean spoken sentences. Based on tilt intonation theory [8], two related but separate experiments were performed, both using the Hidden Markov Model training technique. In the first experiment, a system is established so that rough boundary positions of major intonation events are detected. Subsequently the significant parameters are extracted from the products of the first experiment, which are directly used to train the final models for utterance type disambiguation. Results show that the intonation contour can be used as a significant meaning distinguisher in an automatic speech recognition system of Korean as well as in a natural human communication system.
منابع مشابه
PROSICE: A spoken English database for prosody research
Prosody the study of the intonation, stress and rhythm of speech is now assuming a greater importance in phonetics, phonology and speech technology than ever before. Once regarded as subservient to studies of segmental structure, it is now being seen as providing the ‘framework’ which holds different levels of phonetic description together. The recent past has seen novel views of the phonology ...
متن کاملAutomatic intonation classification for speech training systems
A prosodic Hidden Markov model (HMM) based modality recognizer has been developed, which, after supra-segmental acoustic pre-processing, can perform clause and sentence boundary detection and modality (sentence type) recognition. This modality recognizer is adapted to carry out automatic evaluation of the intonation of the produced utterances in a speech training system for hearing-impaired per...
متن کاملThe Role of Prosody in Disambiguating Potentially Ambiguous Utterances in English and Italian
We investigate the role that intonation plays in disambiguating potentially ambiguous utterances in English and Italian, to see a) whether speakers employ intonational means to disambiguate these utterances, and b) whether speakers of the two languages employed consistently different intonational strategies in this disambiguation. We find that, while some semantic phenomena are consistently dis...
متن کاملRecognizing Verbal Irony in Spontaneous Speech
We explored the differential impact of auditory information and written contextual information on the recognition of verbal irony in spontaneous speech. Based on relevance theory, we predicted that speakers would provide acoustic disambiguation cues when speaking in situations that lack other sources of information, such as a visual channel. We further predicted that listeners would use this in...
متن کاملUsing prosody and phonotactics in Arabic dialect identification
While Modern Standard Arabic is the formal spoken and written language of the Arab world, dialects are the major communication mode for everyday life; identifying a speaker’s dialect is thus critical to speech processing tasks such as automatic speech recognition, as well as speaker identification. We examine the role of prosodic features (intonation and rhythm) across four Arabic dialects: Gul...
متن کامل